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, Abstract 

We consider a bidirectional network in which two users exchange information with the help of a buffer-aided 
relay. In such a network without direct link between user 1 and user 2, there exist six possible transmission modes, 
t/3 ' i.e., four point-to-point modes (user 1-to-relay, user 2-to-relay, relay-to-user 1, relay-to-user 2), a multiple access 

mode (both users to the relay), and a broadcast mode (the relay to both users). Because of the buffering capability 



at the relay, the transmissions in the network are not restricted to adhere to a predefined schedule, and therefore, all 



^ ■ the transmission modes in the bidirectional relay network can be used adaptively based on the instantaneous channel 

(N ■ 

. state information (CSI) of the involved links. For the considered network, assuming fixed transmit powers for both 

. the users and the relay, we derive the optimal transmission mode selection policy which maximizes the sum rate. The 



proposed policy selects one out of the six possible transmission modes in each time slot based on the instantaneous 
CSI. Simulation results confirm the effectiveness of the proposed protocol compared to existing protocols. 

I. Introduction 



I In a bidirectional relay network, two users share information via a relay node ||T|. Several protocols have been 

, proposed for such a network under the practical half-duplex constraint, i.e., a node cannot transmit and receive at 

the same time and in the same frequency band. The simplest protocol is the traditional two-way protocol in which 
the transmission is accomplished in four successive point-to-point phases: user 1-to-relay, relay-to-user 2, user 2- 
to-relay, and relay-to-user 1. In contrast, the time division broadcast (TDBC) protocol exploits the broadcasting 
capability of the wireless medium and combines the relay-to-user 1 and relay-to-user 2 phases into one phase, the 
broadcast phase Q. Thereby, the relay broadcasts a superimposed codeword, carrying information for both user 
1 and user 2, such that each user is able to recover its intended information by self-interference cancellation. To 
further increase the spectral efficiency, the multiple access broadcast (MABC) protocol was proposed in which the 
user 1-to-relay and user 2-to-relay phases are also combined into one phase, the multiple-access phase |3l. In the 
multiple-access phase, user 1 and user 2 simultaneously transmit to the relay, which decodes both messages. For the 
bidirectional relay network without a direct link between user 1 and user 2, the capacity regions of all six possible 
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Fig. 1. Three-node bidirectional relay network consisting of two users and a relay. 



transmission modes, i.e., the four point-to-point modes, the multiple access mode, and the broadcast mode, are 
known ||4l, ||5]. Using this knowledge, a significant research effort has been dedicated to obtaining the achievable 
rate region and the capacity of the bidirectional relay network with and without fading jT]- ||8]. The average capacity 
of the bidirectional relay network without a direct link between user 1 and user 2 has been obtained for additive 
white Gaussian noise (AWGN) channels without fading, whereas when fading is present, only achievable rates are 
known. The reason for this is the fixed schedule of using the transmission modes which is adoped in all existing 
protocols, and does not exploit the instantaneous channel state information (CSI) of the involved links. For one-way 
relaying, an adaptive link selection protocol was proposed in jO) where based on the instantaneous CSI, in each 
time slot, either the source-relay link or the relay-destination link is selected for transmission. To this end, the relay 
has to have a buffer for data storage. This strategy was shown to achieve the capacity of the one-way relay channel 
with fading ^\. 

Motivated by the protocols in ||9l and ifTOl . our goal is to utilize all available degrees of freedom in the three- 
node half-duplex bidirectional relay network with fading, via a buffer-aided and adaptive mode selection protocol. 
In particular, given the fixed transmit powers of all three nodes in the bidirectional relay network, we find a policy 
which selects the optimal transmission mode from the six available modes in each time slot such that the sum 
rate is maximized. A similar problem was considered in fSl. However, the selection policy in |8| does not use all 
possible modes, i.e., it only selects between two point-to-point modes and the broadcast mode, and assumes that the 
transmit powers of all three nodes are identical. We will show that considering only the three modes in jS) is not 
optimal. In fact, the multiple access mode is selected with non-zero probability in the proposed optimal selection 
policy. Finally, we note that the advantages of buffering come at the expense of an increased end-to-end delay. 
Therefore, the proposed protocol is suitable for applications that are not sensitive to delay. The delay analysis of 
the proposed protocol is beyond the scope of the current work and is left for future research. 

The remainder of the paper is organized as follows. In Section [III the system model is presented. In Section 
Unl the average sum rate is investigated and the optimal mode selection policy is provided. Simulation results are 
presented in Section |IV] Finally, Section |V] concludes the paper. 

II. System Model 

In this section, we first describe the channel model and review the achievable rates for the six possible transmission 
modes. 
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Fig. 2. The six possible transmission modes in tlie considered bidirectional relay network. 



A. Channel Model 

We consider a simple network in which users 1 and 2 exchange information with the help of a relay node, as 
shown in Fig. [T] We assume that there is no direct link between user 1 and user 2, and thus, user 1 and user 2 
communicate with each other only through the relay node. We assume that all three nodes in the network are half- 
duplex. Furthermore, we assume that time is divided into slots of equal length and each node transmits codewords 
which span one time slot except in the multiple access mode in which the nodes may transmit codewords which 
span a fraction of the time slot. The duration of one time slot is normalized to one. We assume that the users-to-relay 
and relay-to-users channels are all impaired by AWGN having zero mean and unit variance, and block fading, i.e., 
the channel coefficients are constant during one time slot and change from one time slot to the next. Moreover, 
the channel coefficients are assumed to be reciprocal such that the user 1-to-relay and user 2-to-relay channels are 
identical to the relay-to-user 1 and relay-to-user 2 channels in each time slot, respectively. Let hi{i) and /i2(i) 
denote the channel coefficients between user 1 and the relay and between user 2 and the relay in the i-th time slot, 
respectively. Furthermore, let Si{i) — \hi{i)\'^ and 52 (i) — |/i2(*)|'^ denote the squares of the channel coefficient 
amplitudes in the i-th time slot. Si{i) and 52 (i) are assumed to be ergodic and stationary random processes with 
means 51i — E{Si} and VL2 — £'{52 }Q. respectively, where £{•} denotes expectation. Since the noise is AWGN, 
in order to achieve the capacity in each mode, each node has to transmit codewords which are Gaussian distributed. 
Therefore, the transmitted codewords of user 1, user 2, and the relay are composed of symbols which are modeled 
as Gaussian random variables with variances Pi,P2, and Pr, respectively, where Pj is the transmit power of node 
j S {1, 2, r}. For ease of notation, we define C{x) = log2(l + x). In the following, we introduce the transmission 
modes and their achievable rates. 

B. Transmission Modes and Their Achievable Rates 

In the considered bidirectional relay network only six transmission modes are possible, cf. Fig.|2] The six possible 
transmission modes are denoted by A4i, ...,A4e, and Rjji{i) > 0, G {l,2,r}, denotes the transmission rate 
from node j to node j' in the i-th time slot. Let Bi and B2 denote two infinite-size buffers at the relay in which 
the received information from user 1 and user 2 is stored, respectively. Moreover, Qj{i), j G {1, 2} denotes as the 
amount of normalized information in bits/symbol available in buffer Bj in the i-th time slot. Using this notation, 
the transmission modes and their respective rates are presented in the following: 

'in the paper, we drop time index i in expectations for notational simplicity. 
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Aii: User 1 transmits to the relay and user 2 is silent. In this mode, the maximum rate from user 1 to the relay in 
the i-th time slot is given by Rir{i) — Cir{i), where Cir{i) = C{PiSi{i)). The relay decodes this information and 
stores it in buffer Bi. Therefore, the amount of information in buffer Bi increases to Qi{i) = Qi{i — 1) + Rir{i)- 

j\42' User 2 transmits to the relay and user 1 is silent. In this mode, the maximum rate from user 2 to the relay in 
the i-th time slot is given by R2r{i) — C2r(i)' where C2r{i) = C{P2S2{i))- The relay decodes this information and 
stores it in buffer i?2. Therefore, the amount of information in buffer B2 increases to Q2(*) = Q2(* — 1) + R2r{i)- 
Both users 1 and 2 transmit to the relay simultaneously. For this mode, we assume that multiple access 
transmission is used, see ||5]. Thereby, the maximum achievable sum rate in the i-th time slot is given by Rir{i) + 
R2r{i) = Cr{i), where Cr{i) = C{PiSi{i) + P2S2{i))- Since user 1 and user 2 transmit independent messages, 
the sum rate, Cr{i), can be decomposed into two rates, one from user 1 to the relay and the other one from user 
2 to the relay. Moreover, these two capacity rates can be achieved via time sharing and successive interference 
cancellation. Thereby, in the first < t{i) < 1 fraction of the i-th time slot, the relay first decodes the codeword 
received from user 2 and considers the signal from user 1 as noise. Then, the relay subtracts the signal component 
of user 2 from the received signal and decodes the codeword received from user 1 . A similar procedure is performed 
in the remaining 1 — t{i) fraction of the i-th time slot but now the relay first decodes the codeword from user 1 and 
treats the signal of user 2 as noise, and then decodes the codeword received from user 2. Therefore, for a given 
t{i), we decompose Cr{i) as Cr{i) — Ci2r(i) + C2ir{'i) and the maximum rates from users 1 and 2 to the relay 
in the i-th time slot are Rir{i) ~ Ci2r{i) and R2r{i) — C2ir{i), respectively. Ci2r{i) and C2ir{i) are given by 



The relay decodes the information received from user 1 and user 2 and stores it in its buffers Bi and B2, respectively. 
Therefore, the amount of information in buffers Bi and B2 increase to Qi{i) ~ Qi{i ~ ^) + Rir(i) and Q2{i) = 
Q2(i ~ 1) + R2r{i), respectively. 

Ad^. The relay transmits the information received from user 2 to user 1. Specifically, the relay extracts the 
information from buffer B2, encodes it into a codeword, and transmits it to user 1. Therefore, the transmission rate 
from the relay to user 1 in the i-th time slot is limited by both the capacity of the relay-to-user 1 channel and the 
amount of information stored in buffer B2- Thus, the maximum transmission rate from the relay to user 1 is given 
by i?ri(i) = min{Cri(i), Q2(« ~ 1)}, where Cri{i) = C{PrSi{i)). Therefore, the amount of information in buffer 
B2 decreases to Q2{i)~Q2{i — ^) — Rri{i)- 

This mode is identical to A^4 with user 1 and 2 switching places. The maximum transmission rate from 
the relay to user 2 is given by Rr2{i) = inin{Cr2(*)i ~ 1)}' where Cr2{i) — C{PrS2{i)) and the amount of 
information in buffer Bi decreases to Qi{i) — Qi{i — 1) — Rr2{i)- 

Mq. The relay broadcasts to both user 1 and user 2 the information received from user 2 and user 1, respectively. 
Specifically, the relay extracts the information intended for user 2 from buffer Bi and the information intended for 
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user 1 from buffer B2. Then, based on the scheme in |4|, it constructs a superimposed codeword which contains 
the information from both users and broadcasts it to both users. Thus, in the i-th time slot, the maximum rates from 
the relay to users 1 and 2 are given by Rri{i)~inm{Cri{i),Q2{i — 1)} and Rr2{i) = min{Cr2(i), — 1)}, 
respectively. Therefore, the amount of information in buffer Bi and B2 decrease to Qi{i)^Qi{i—l)—Rr2{i) and 
Q2{i)^Q2ii~i)-Rriii), respectively. 

Our aim is to develop an optimal selection policy which selects one of the six possible transmission modes in 
each time slot such that, given Pi, P2, and Pr, the average sum rate of both users is maximized. To this end, we 
introduce six binary variables qkii) G {0, 1}, fc = 1, 6, where qk{i) indicates whether or not transmission mode 
Mk ^s, selected in the i-th time slot. In particular, qkii) = 1 if mode A^^ is selected and qkii) = if it is not 
selected in the i-th time slot. Moreover, since in each time slot only one of the six transmission modes can be 
selected, only one of the mode selection variables is equal to one and the others are zero, i.e., 1k{i) = 1 

holds. 

In the proposed framework, we assume that one node (e.g. the relay node) is responsible for deciding which 
transmission mode is selected based on the full CSI of both links and the proposed protocol, cf. Theorem |2] Then, 
it sends its decision to the other nodes and transmission in time slot i begins. Furthermore, for each node to be 
able to decode their intended messages and adapt their transmission rates, all three nodes have to have full CSI of 
both links. 

III. Adaptive Mode Selection 

In this section, we first investigate the achievable average sum rate of the network. Then, we formulate a 
maximization problem whose solution yields the maximum average sum rate. 

A. Achievable Average Sum Rate 

We assume that user 1 and user 2 always have enough information to send in all time slots and that the number 
of time slots, N, satisfies N ^ 00. Therefore, using qk{i), the user 1-to-relay, user 2-to-relay, relay-to-user 1, and 
relay-to-user 2 average transmission rates, denoted by Rir, R2r, Rri, and Rr2, respectively, are obtained as 

1 ^ 

Rir= lim —y^[qi{i)Cir{i) + q3{i)Ci2r{i)] (2a) 

i=l 
1 ^ 

i?2r= lim ^ V [q2{i)C2rii) + q3{l^C21ril)] (2b) 

i=l 
N 



Rri^ lim [q4{i) + qe{i)]mm{Cri{i),Q2ii ~ I)} (2c) 

N~>-oo I\ ^ — ' 
i=l 

1 ^ 

RrT= lim — [g5(«) + 96(i)]niin{Cr2(«):Qi(« - !)}■ (2d) 

A/'— ^nn /V ^ ' 



i=l 



The average rate from user 1 to user 2 is the average rate that user 2 receives from the relay, i.e., Rr2- Similarly, the 
average rate from user 2 to user 1 is the average rate that user 1 receives from the relay, i.e., Rri- In the following 



May 1, 2013 



DRAFT 



6 



theorem, we introduce a useful condition for the queues in the buffers of the relay leading to the optimal mode 
selection poUcy. 

Theorem 1 (Optimal Queue Condition): The maximum sum rate, Rri + Rr2, for the considered bidirectional 
relay network is obtained when the queues in buffers Bi and B2 are at the edge of non-absorbtion. More precisely, 
the following conditions must hold for the maximum sum rate: 

1 ^ 

Rir = Rr2= lim — [95(1) + g6(«)] C'r2(«) (3a) 

1=1 
1 ^ 

R2r=Rrl= lim — [94(«) + 96(j)] C'rl(«) (3b) 

i=l 

where Rir and R2r are given by (|2^) and ^p), respectively. 
Proof Please refer to Appendix |A] 

Using this theorem, we now derive the optimal transmission mode selection policy. 

B. Optimal Mode Selection Protocol 

The available degrees of freedom in the considered network are the mode selection variables qk{i), Vfe, i and the 
time sharing variable in the multiple access mode, t{i), Vi. In the following, we formulate a problem for optimization 
of qk{i), yk, i and t{i), Vi such that the average sum rate, Rri + Rr2, is maximized: 

maximize Rir + R2r 

qk{i),t{i), \/i,k 

subject to CI : Rir = Rr2 

C2 : R2r ~ Rri 

C4: gfc(z)[l-gfe(i)] =0, Vi,fc 

C5 : < t{i) < 1, Vi. (4) 

where CI and C2 are due to Theorem [T] C3 and C4 guarantee that only one of the transmission modes is selected 
in each time slot, and C5 specifies the interval of the time sharing variable. Considering constraints CI and C2, 
the sum rate, ^,.1 + Rr2, is also given by Rir + R2r- 

As will be seen later, the solution of (|4]i, i.e., the optimal mode selection policy may require coin flips. Therefore, 
we define X„(z) G {0, 1} as the outcomes of the n-th coin flip in the i-th time slot. The probabilities of the possible 
outcomes of the n-th coin flip are Pr{X„(z) = 1} = p„ and Pr{Xn{i) = 0} = 1 — p„. In the following theorem, 
we provide the solution to the maximization problem in (|4]l. To this end, we define three mutually exclusive signal- 
to-noise ratio (SNR) regions and each region requires a different optimal selection policy. For given ili, ^2, Pi, 
P2, and Pr, exactly one of the SNR regions is applicable. 
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Theorem 2 (Mode Selection Policy): The optimal mode selection policy which maximizes the average sum rate 
of the considered three-node half-duplex bidirectional relay network with AWGN and block fading is given by 



qk' (i) = 



1, k 

0, otherwise 

where A^.{i) is referred to as the selection metric, given by 



arg ma,x{Ik{i)Ak{i)} 

fe=l,...,6 



(5) 



Ai(i 

A2(* 
A3(* 

Asii 



= (1 - ^ll)Clr{^) 
= (1-^^2)^2.(1) 

= (1 - fil)Ci2r{i) + (1 - H2)C21r{i) 
= lJ-2Crl{i) 
= lJ-lCr2{i) 

= lJ-lCr2{i) + M2C'rl(«) 



(6a) 
(6b) 
(6c) 
(6d) 
(6e) 
(6f) 



and Ik{i) £ {0, 1} is a binary indicator variable which determines whether M.^ is a possible candidate for selection 
in the i-th time slot, i.e., mode M.^ cannot be selected if Xk{i) = 0, and jii and ^2 are decision thresholds. For 
the values of Xfc(i), the optimal values of /ii and /i2 in (|6]l, and the optimal value of t{i) in Ci2r{i) and C2ir(*) 
in (|6};), we distinguish three mutually exclusive SNR regions: 

SNR Region Ui: SNR region Ui is chractarized by {P2 > Pr AND wi < 1} OR {P2 < Pr AND W2 < 1} OR 

Sp - p Awn, ^, M with , , - E{qCr-2} , , _ E{qC2r} , I _ E {C ^2} , ,„ _ ^{Cri} 

{P2-PrANDuj3<UJ3\ with Wi - £;{(i_g)[c',-C2,.]}''^2 - E{{l-q)Cri}^^3 " E{C,} ' ^nd - £;{c,i+C2.}' 

where q{i) is defined as 



'1, if |P2>/^. ANDA3<A6f^^^f'''' J 
9(0 = S OR |P, < Pr AND A3 > Aef'^^^J J 



0, otherwise 



(7) 



In if P2 > Pr, IJ-2 is chosen such that £'{(1 — q)C2r} = E{qCri} holds, whereas if P2 < Pr, Mi is chosen 
such that E{q[Cr - C2r]} = E{{1 - q)Cr2} holds. 

For this SNR region, t{i) — 0, Vi, and Ik{i) — 0, for k — 1,4, Mi, whereas Ik{i) for k — 2,3,5,6, and the 
thresholds /ii and 112 are as follows 



If P2 > Pr then < and < 

A*2=/i2 



2:5(«)=o 

l3(*) = 1-12(0 = ^1(0 
[X6(z) = l 
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If P2<-Prthen 



A*i=Mi 



A*2 = 



and < 



If F2 = ^^rthcn 




and < 



where the coin flip probabihties are pi = uJi,p2 = a;2,P4 
freedom to choose any value in the interval [0^3 , ^3 ] . 



X2(l)=0, 

l2{i) = X3{l)[l-Xi{l)] 

X5« = [1-^3(*)][1-^5W] 
Ie{l) = [l-X3{l)]X5{l) 



(1-P3)4 „r 



(1-P3)wj 



, and for p^, we have the 



SNR Region 7^2: SNR region 7^2 is charactarized by {Pi > Pr AND wi < 1} OR {Pi < Pr AND W2 < 1} OR 

{Pi = P, AND4 < ..3"} with 0.1 = Eiii^incT-c.A} ^^' = 4i^g;cLp 4 
where q{i) is defined as 



EjCrl} J ti _ E{Cr2} 



'1, if {Pi>p.AND[A3<Ae];f;L;i;;:^j 

0, otherwise 



(8) 



In ([8]), if Pi > Pr, the threshold ^1 is chosen such that E{{l-q)Cir} = E{qCr2} holds, whereas if Pi <Pr, the 
threshold /Zj is chosen such that E{q[Cr<Cir]} — E{{l<q)Cri} holds. 

For this SNR region, t{i) = 1, Vi, and Ifc(i) = 0, for k ~ 2,5, Vi, whereas Ife(i) for k = 1,3,4,6, and the 
thresholds /ii and /i2 are as follows 



If Pi>Prthen 




and < 



If Pi < Pr then 




and < 



If Pi =P^ then ■ 



/i2 = l 



and < 



24(1) =0 

l3il)^l-Il{l)=Xiil) 

Ji(i)=0, 

I6W = 1-I4«=A2W 

Zl(^)=A3(^)[l-A4(i)] 
Z3(^)=A3(^)A4(^) 
Z4« = [1-A3(«)][1-A5W] 
X6(z) = [l-A3(z)]A5(z) 
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where the coin flip probabihties are pi = U!i,p2 — ^2, Pa = 'Ps ~ \i-p3)u>^ ' P^' the 

freedom to choose any value in the interval , ] . 

SNR Region 7^o: The SNR region is 7^o if and only if it is not Ui or 112- For SNR region 7^o, Xfc(i) = for 
A; = 1, 2, 4, 5 and = 1 for fc = 3, 6, V«, and is given by 



0, if i < 

1, if < t < 1 (9) 
1, if i > 1 



where t = ^^'^^g^c^^l^c^'^lc^jf^^^"^ and q{i) is given by 

{1 if £All > t^' 
' c-W+c^-W - (10) 
0, if otherwise 

where /i* is chosen such that E{qCr}=E^l—q){Cri+Cr2]\ holds. Moreover, if < t < 1, we obtain pi = 112 = l-i* ■ 
Otherwise, if t < or t > 1, we obtain /ii > 112 and /ii < /i2, respectively, and /^i and /i2 are chosen such that 
CI and C2 in © hold. 

Proof Please refer to Appendix |B] 

We note that in the proposed optimal policy, regardless of the SNR region, the multiple access and broadcast 
modes are always selected with non-zero probability. Only in the extreme case, when the average SNR of one of the 
links is much larger than the average SNR of the other link, some of the point-to-point modes are also selected in 
the optimal policy. Also, the mode selection metric Afe(i) includes two parts. The first part is the capacity of mode 
A^fc, and the second part are the decision thresholds jii and/or ^2- The capacity is a function of the instantaneous 
CSI, but the decision thresholds and 112 depend only on the statistics of the channels. Hence, the decision 
thresholds, pi and 112, can be obtained offline and used as long as the channel statistics remain unchanged. 

Remark: To find the optimal values of jii and ^2, we need one/two-dimensional searches over [0 , 1]. Specifically, 
in SNR regions TZi and 7?.2, at least one of the thresholds and ^2 is given. Therefore, at most one one-dimensional 
search is required. In SNR region TZq, if < o; < 1, we obtain /ii = = /^*, therefore, a one-dimensional search 
is required to determine However, if a; < or w > 1, we need a two-dimensional seaixh to find and ^2- 

IV. Simulation Results 

In this section, we evaluate the maximum average sum rate in the considered bidirectional relay network for 
Rayleigh fading. Thus, channel gains Si{i) and 6*2 (i) follow exponential distributions with averages f^i and 0,2, 
respectively. Furthermore, the transmit powers of user 1 and user 2 are identical and set to Pi = P2 = 10 dB, and 
0,2 dB. The number of time slots used in the simulations is = 10^. 
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Fig. 3. Maximum acliievable sum rate versus Qi. 



Fig. |3] illustrates the maximum achievable sum rate versus fii. In order to simulate all possible cases introduced 
in Theorem 12] we consider Pr = 5, 10, 15 dB, which yields the solid lines in Fig. [3] As the value of fli varies 
from —10 dB to 10 dB in Fig. [3] the SNR region changes from TZ2 to TZq and then from TZq to TZi. Since in SNR 
region TZ2 the bottleneck is the relay-to-user 1 link, reducing ili reduces the sum rate. On the other hand, in SNR 
region TZi, the bottleneck link is the relay-to-user 2 link, and therefore, for large ili, the maximum average sum 
rate saturates. 

As performance benchmarks, we consider the traditional two-way, the TDBC lIU, the MABC [Sl, and the buffer- 
aided protocol in ||8]. Since in JS], the protocol is derived for Pi = P2 = Pr, in Fig. 3, we only show the sum 
rates of the benchmark schemes for Pr — 10 dB. From the comparison in Fig. 3, we observe that a considerable 
gain is obtained by adaptive mode selection compared to the traditional two-way, TDBC, and MABC protocols. 
Furthermore, a small gain is obtained compared to the buffer-aided protocol in |8| which selects only the user 
1-to-relay, user 2-to-relay, and broadcast modes adaptively. Finally, we note that our protocol is derived for given 
ill, ^2, Pi, P2, and Pr, and without a constraint on the total average power consumed by all nodes. The problem 
with a total average power constraint would lead to a different solution. 

V. Conclusion 

We have derived the optimal transmission mode selection policy for sum rate maximization of the three-node 
half-duplex bidirectional buffer-aided relay network with fading links and fixed transmit powers at the nodes. The 
proposed selection policy determines the optimal transmission mode based on the instantaneous CSI of the involved 
links in each time slot and their long-term statistics. Simulation results confirmed that the proposed selection policy 
outperforms the existing protocols from the literature in terms of average sum rate. 
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Appendix A 

Proof of Theorem 1 (Optimal Queue Condition) 

Let Cri and Cr2 denote the average rates achieved by the relay assuming that buffers Bi and B2 always have 
enough information to supply. Then, Cri and Cr2 are given by 

1 ^ 

Cri = Jim — V Mi) + qeii)] Cri{i) (11a) 

A/ — >oo iV ^ — ^ 

1=1 

1 ^ 

Cr2= lim - V[g5W + g6«]a2W. (lib) 

1=1 

If we assume that R2r > Cri holds, i.e., the average rate flowing into the buffer is larger than or equal to the 
average capacity that can flow out of the buffer, then the buffer always has enough information to supply and Rri in 
dlJ becomes Rri = Cri, where Cri is given in (fTTk). For a proof of this property, we refer to the proof of Theorem 
1 in 19] Appendix A]. On the other hand, if we assume that R2r < Cri holds, i.e., the average information flowing 
into the buffer is less than the average capacity that can flow out of the buffer, then by the law of conservation of 
flow, we obtain Rri = R2r- Similar results hold for Rr2, Rir, and (7^2- Hence, Rri and Rr2 can be written as 

Cri, if R2r Cri 

Rri^i _ (12a) 
R2r, otherwise 

Cr2, if Rir ^ Cr2 
Rr2 = < (12b) 



Rir, otherwise 



On the other hand, (fTST i can also be written as 



Rri = mm{R2r,Cri} (13a) 

Rr2=mm{Rir,Cr2} (13b) 

i.e., the average rate flowing out of buffer Bi {B2) is limited by the minimum of the average rate flowing into 
the buffer and the capacity rate that can flow out of the buffer. We will show that, in the optimal selection policy, 
-Rir — Cr2 and R2r — Cri must hold, and as a result from (fT3h ) and JTSb). we obtain Rir = Rr2 = Cr2 and 
R2r = Rri = Cri- To show this, wc denote by Ik the set comprising the time slots i in which qk{i) = 1, i.e., the 
elements in represent the time slots in which transmission mode A^fc is selected in the optimal solution. Note 
that X]fc=i l^fcl ~ where | • | denotes the cardinality of a set and denotes an empty set with zero cardinality. 
We will show that, except for the case when both arguments of the min functions in (fT3h) and (fT3b) are equal, all 
other cases result in suboptimal sum rate or can be safely replaced with the case when both arguments of the min 
function are equal without reducing the sum rate. For the two min functions, we have to consider the following 
nine cases: 

Case 1: If Rir>Cr2 and R2r>Cri, then we can move any indices i from /i,/2, or to /g and thereby increase 
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both Cri and Cr2, and thus improve the sum rate. Hence, Rir>Cr2 and R2r>Cri cannot hold since the resulting 
sum rate is suboptimal. 

Case 2: If Rir—Cr2 and R2r>Cri, then if I2 7^ 0, one can move some indices from I2 to I4 which maintains 
Rir = Cr2 and increases Cri, and consequently increases the sum rate, thereby contradicting optimality. On the 
other hand, if I2 ~ 0, we have I3 (Z) since otherwise, R2r — and because we have assumed R2r>Cri, we 
obtain R2r = Cri = 0. Hence, assume I2 ^ <Z) and /a 7^ 0, then there are two possibilities. The first is when 
t{i) — 1, Vi for which 93(1) — 1. Then, since Ci2r{i) becomes Ci2r{i) — Cir{i), we can move some indices from 
I3 to /i, which maintains Rir=Cr2 and decreases R2r until we obtain R2r=Cri, thereby contradicting R2r>Cri- 
However, if tij?) ^ 1 for some time slots i for which 93(1) = 1, then since Ci2r{i) < Cir{i) holds, we can move 
some indices from 73 to Ii and I4 such that Rir=Cr2 is maintained and Cri is increased and thereby the sum rate 
is increased, which results in a contradiction of the optimal sum rate assumption. 

Case 3: If Rir = Cr2 and R2r<Cri, then if I4 ^ 0, we can move some indices of I4 to I2 which maintains 
Rir=Cr2 and increases R2r and consequently increases the sum rate and thus contradicts optimality. On the other 
hand, if £4 = 0, we must have /g ^ 0, otherwise, Cr2 = 0, and since we have assumed R2r<Cri, we obtain 
R2r=Cri = 0. However, if /g 7^ 0, we can move some of the indices in /g to 75 which maintains Rir=Cr2 and 
decreases Cri until we obtain R2r—Cri, thereby contradicting R2r<Cri- 

Case 4: If Rir>Cr2 and R2r<Cri, then if I4 ^ 0, we can move some indices of I4 to 7,5, which increases 
Cr2 and consequently the sum rate and thereby contradicting optimaUty. However, if I4 = 0, then we must have 
76 7^ otherwise Cri — leading to R2r—Cri = and the rest of the conclusions are similar to that in Case 2. 
Then, if 76 ^ 0, one can move some of the indices from 76 to 75 which maintains Rir>Cr2 but decreases Cri 
until we obtain R2r=Cri and the rest of conclusions are similar to those in Case 2. 

The conclusions for the four remaining cases, (Rir<Cr2 and R2r<Cri), {Rir>Cr2 and R2r—Cri), {Rir<Cr2 
and R2r — Cri), and (^i^ < Cr2 and R2r > Cri) are similar to those in Case 1, Case 2, Case 3 and Case 4, 
respectively. Therefore, the final case remaining for the optimal solution is Rr2=Rir — Cr2 and Rri — R2r = Cri- 
This completes the proof. 

Appendix B 

Proof of Theorem 2 (Mode Selection Protocol) 

In this appendix, we solve the optimization problem given by (|4|i. We first relax the binary condition for qk{i), 
i.e., — <lk{i)] = 0, to < qk{i) < 1, and later in Appendix ICl we prove that the binary relaxation does not 

affect the maximum average sum rate. In the following, we investigate the Karush-Kuhn-Tucker (KKT) necessary 
conditions of the relaxed optimization problem and then show that the necessary conditions result in a unique sum 
rate and thus the solution is optimal. 

To simplify the usage of the KKT conditions, we formulated a minimization problem equivalent to the relaxed 
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C{qk {i),t{i), X{i), ak{i), Pk{i), 4>i{-i)) 

for Vi,k,l 



N 



-{Rlr + i?2r) + Ml(filr - Rr2) + Ai2(i?2r " Rrl) + ^ A (i) ^ ^fe (i) - 1 

JV 6 N 6 N N 

- ^ ^ a, (z) (^) - 1) - E E (^) + E Mi)m - 1) - E 



(15) 



=1 fe=i 



=1 fc=i 



maximization problem in Q as follows 

minimize —{Rir + R2r) 

qk(i),t{i), 'ii,k 

subject to CI : Rir — Rr2 = 

C2 : i?2r — Rrl — 

C3 : ELi '''' - 1 ^ 0, Vi 

C4 : gfc(i) - 1 < 0, Wi,k 

C5: -gfe(i) < 0, Vz,fc 

C6 : tii) - 1 < 0, Vj 

C7 : < 0, Vi. (14) 

The Lagrangian function for the above optimization problem is provided in dTsT l at the top of the next page where 
Mil /^2, afe(i), (3k{i), and 02(«) are the Lagrange multipliers corresponding to constraints CI, C2, C3, C4, 
C5, C6 and C7, respectively. The KKT conditions include the following: 

1) Stationary condition: The differentiation of the Lagrangian function with respect to the primal variables, qkii) 
and t{i) for Vi, k, is equal to zero for the optimal solution, i.e.. 



dqk(i) 
dC 



for Vi, k (16a) 
for Vi. (16b) 



dt{i) 

2) Primal feasibility condition: The optimal solution has to satisfy the constraints of the primal problem in ([14). 

3) Dual feasibility condition: The Lagrange multipliers for the inequality constraints have to be non-negative, i.e., 

afe(i) > 0,Vi,A: (17a) 
/3fe(i) > 0,Vi,A: (17b) 
0/(«)>O,Vi,Z (17c) 
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have to hold. 

4) Complementary slackness: If an inequality is inactive, i.e., the optimal solution is in the interior of the corre- 
sponding set, the corresponding Lagrange multipliers are zero. Thus, we obtain 



afc («) (gfe W - 1) - 0, yi,k (18a) 

/3k{i)qkit) = 0, yi,k (18b) 

<j)i{i){t{i) -I) = Q, yi (18c) 

(j)Q{i)t{i) ^ 0, Vi. (18d) 



A common approach to find a set of primal variables, i.e., qk{i),t{i), Vi, k and Lagrange multipliers, i.e., /^i, ^2, 
o^kii), Pkii), Vi,fc,Z, which satisfy the KKT conditions is to start with the complementarity slackness con- 

ditions and see if the inequalities are active or not. Combining these results with the primal feasibility and dual 
feasibility conditions, we obtain various possibilities. Then, from these possibilities, we obtain one or more candidate 
solutions from the stationary conditions and the optimal solution is surely one of these candidates. In the following 
subsections, we first pursue this procedure in order to find optimal q1{i) and t*{i) for Vi, k, and then, we construct 
the protocol introduced in Theorem |2] based on three exclusive SNR regions. 

A. Optimal <?^,(i) 

In order to determine the optimal selection policy, we differentiate the Lagrangian function in (flST l with respect 
to qfc(j), for fc = 1, 6, equate the result to zero, and obtain the corresponding qk{i)- This leads to 



ql{t) : -^(1 - Mi)Ci.(i) + X{i) + ai{i) - = (19a) 

q*{i) : -^(1 - ^^2)C2ri^) + A(i) + - Hi) = (19b) 

'?3(*) : -^[(1 - Ml)C^12r(«) + (1 - M2)C^21r(i)] + H^) + M^) " /^sCO = (19c) 

qlii) : -1^2ai(j) + A(^) + ai{i) - /34W = (19d) 

q;{i) : -1^1^2(1) + A(^) + a^ii) - M^)) = (19e) 

96(0 : -i-MCr2ii) + li2Criit)] + A(i) + aeii) - l3e(i) = 0. (19f) 



Without loss of generality, we obtain the necessary condition for ql{i) = 1 and then generalize the result to 
ql{i) — 1 for fc = 2, . . . ,6. From constraint C3 in (fT4l i. the other selection variables are zero, i.e., q1{i) — 0, for 
k — 2,3,4,5,6. Furthermore, from (ITsK ). we obtain ak{i) = 0, for k = 2,3,4,5,6 and from (fTsb). we obtain 



— 0. Then, by substituting these values into (fT9] l, we obtain 

A(^) + ai{i) = (1 - tii)Cir{t) = Ai(i) (20a) 

A(^) - P2{i) = (1 - A*2)C2.(z) = A2(z) (20b) 
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X{i) ~ /33(l) = (l-A*l)C^12r(«) + (l-A'2)C21.(l)=A3(z) (20c) 

\(%) - /34(^) = /i2ai iS) = A4(*) (20d) 
X{i) - /35ii) = fiiCr2{i) - A5(i) (20e) 
A(i) - Pe{^) = f^iCr2{i) + A*2ai(«) = A6(«) (20f) 

where Afc(i) is referred to as selection metric. By subtracting (|20k ) from the rest of the equations in (|20] i. we obtain 

Ai(i)-AfeW =aiW+/3fcW, fc = 2,3,4,5,6. (21) 

From the dual feasibility conditions given in (fTTk ) and (flTb). we have afc(i), /3fe(i) > 0. By inserting ak{i), f3k{i) > 
in dZU, we obtain the necessary condition for ql{i) = 1 as 

Ai{i) > max{A2(^),A3(^),A4(^),A5(^),A6(^)}. (22) 

Repeating the same procedure for q1{i) = 1, k — 2,..., 6, we obtain the necessary condition for selecting 
transmission mode Mk* in the i-th time slot as 

Ak'ii)> max {Afc(i)}, (23) 
fce{i,--- ,6} 

where the Lagrange multipliers fii and ^2 are chosen such that CI and C2 in (fl4] i hold and the optimal value of 
t{i) in Ci2r(i) and C2ir(i) is obtained in the next subsection. We note that fii and /i2 are long-term thresholds 
(Lagrange multipliers) which only depend on the statistics of the channels. Moreover, we prove in Appendix ID] that 
the optimal values of /ii and 112 are in the interval [0 1]. Moreover, for /ii 7^ 0, 1 and 112 7^ 0, 1, the probability 
that two selection metrics in (|20] | are equal is zero due to the randomness of the time-continuous channel gains. 
Therefore, the necessary condition for selecting transmission mode A4k in (l23T l is in fact sufficient and is the 
optimal selection policy. However, if /ii = 0, 1 or fi2 — 0, 1, then some of the selection metrics in (|20| | are equal 
for all time slots. In this case, we have the freedom to choose between the modes which have the same value of 
the selection metric in ( l20l i as long as the long-term constraints CI and C2 in ( fT4l i hold. One way of selecting 
between the modes for which the selection metrics Aj.(i) are identical is to use a probabilistic approach via coin 
flips. Therefore, let X„(i) e {0, 1} be the outcomes of the n-th coin flip in the i-th time slot with the probabihties 
of the possible outcomes defined as Pr{A'„(i) = 1} = p„ and Pr{X„(i) = 0} = 1 — p„. In order to include the 
coin flip in ( l23T l. we write the necessary and sufficient condition for the selection of mode A4k in the i-th time slot 



Afc(z) > max {Ikii)Ak{i)} (24) 

where Ifc(i) G {0, 1} is a binary indicator variable which is equal to zero for the modes that can not be selected in 
the i-th time slot and one otherwise. As will be seen, the value of Ik{i), depending on the average SNRs, is either 
deterministic or obtained as a function of the outcome of the coin flip(s). 
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B. Optimal t*{i) 

To obtain the optimal t*{i) for the multiple access mode, we assume qt,{i) — 1 and calculate the derivative in 
(fT6b) which leads to 

t*{{) : - M2)[C,.(i) - - C2,.(*)] + 01 (i) - (t>Q{i) = 0. (25) 

Now, we investigate the following possible cases for t*{i): 

Case 1: If < t*(i) < 1, then from (HS};) and ([T8}l), we obtain <j>i{i) = 0, for I ^ 0,1. From ^ and by 
considering Cr(i) — Cir{i) — C2r[i) < 0, we also obtain pi — p2 — M- Combining these results into (|23] |. leads to 

A3(i) = (1 - p)Cr{i) > max{Ai(i), A2(i)} (26a) 
Ae{i) = fiCri{i) + pCr2{i) > max{A4(i), A5(i)}. (26b) 

The inequalities in (l26h) and (l26b) hold with equality with non-zero probability if and only if /i = 1 and /i = 0, 
respectively, however, this leads to a contradiction as shown in Appendix |D] Therefore, only A^3 and A^g are 
selected for the optimal strategy. Hence, we can use the notation in (l24b . by setting Ik{i) = 0, fc = 1,2,4, 5 and 
Ifc(*) = 1, fc = 3, 6, Vi. 

In this case, the values of t{i). Mi, and the value of fi are found such that constraints CI and C2 in ( fT4b hold. 
Note that when CI and C2 hold, the following also holds 

Rlr + i?2r = Rrl + Rr2- (27) 

Now, since and ^^{i) are the only selection metrics for which Ik{i) ^ 0, for k = 3,6, Vi, we observe 

that both A3(«) and ki^ii), and both the left and right hand side in (|27] | are independent of the value of t{i), Vi. 
Thus, we can set [i independently from t{i) such that (l27T i is satisfied. We note that the optimal [i is unique since 
increasing [i increases the right hand side of (l27t and decreases the left hand side, simultaneously. Since the sum 
rate does not depend on t{i), i.e., R\r + R^r — jj J2iLi <l'i{i)Cr{i), any choice of t{i), Vi, that satisfies CI in (fT4l l 
is optimal. Then, if CI in ( fT4] i and ( |27] i hold, C2 in ( fT4] i also holds. As a result, we have satisfied both CI and 
C2 by satisfying CI in ( fT4l i and dZTl l. Therefore, we assume = Vi, and t is given by 

^ - C2r)} - E{{1 - q)Cr2} 



E {q{Cr — Clr — C2r)} 

where 



(28) 



1 if £lW > 

^ ^ ' CM^)+CM^) - M-1 ^ ^29) 

0, otherwise 

We note that the necessary condition for the optimality of this case is that < t < 1. For the cases which lead 
to < < or t > 1, we cannot have /ii = fi2, and therefore, we obtain either t{i) = 1 or t{i) — 0. In these cases, 
we need different selection policies which are investigated in the following. 
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Case 2: If t{i) — 0, then from (fTSt). we obtain (j>i{i) = and from (fTTt). we obtain (/)o(*) > 0. Combining these 
results with (IZST l. we obtain /ii > /X2. Then, by substituting these resuhs in (|20] |. we obtain 

A3(i) = (l - Ail)[a(0 - C2.(i)] + (1 - Ai2)C2.(i) 

=(i-Aii)aw + (i-M2)C2.w 

>max{Ai(i), A2(i)} (30a) 
A6(i)=M2Cri(i) +AiiCr-2(«) > max{A4(z),A5(i)}. (30b) 

The expressions in ([30b) and (|30b) hold with equality with non-zero probability if and only if /ii = 1 and /^2 = 0, 
respectively, otherwise only the inequality holds, respectively. Therefore, we consider the following four possibilities 
for the thresholds /ii and ^2- 

a) If < /ii < 1 and < /i2 < 1, then, ( l30b holds with inequality and therefore, only A^a and Aie are selected. 
Hence, for this case, we can set I/c(i) = 0, k = 1,2,4,5 and Ikii) — 1, k — 3,6 for Since by simultaneously 
increasing and /i2, the left hand side of dZTl l decreases and the right hand side increases, we can always find 
a pair {^1,^2) which satisfies this constraint. Therefore, the necessary condition for the optimality in this case is 
that among the candidates (/ii, /i2) that satisfy dZTl l. there exists a point which satisfies either CI or C2 in (fl4] l (we 
note that since we assume (|27] i holds, if CI holds, C2 must hold too and vice versa). Moreover, if there exists such 
a point then it is unique. To prove this, note that the candidates {111,112) that satisfy dZTl i form a monotonically 
decreasing curve in the plain of {111,112), since by increasing both fii and 112, we simultaneously decrease the left 
hand side and increase the right hand side of (l27t . Moreover, increasing /ui and decreasing 112 results in decreasing 
Rir and Rri, and increasing R2r and Rr2- Therefore, there exists a unique {^1,112) which satisfies CI or C2 in 
(fl4] l. otherwise, either /ii or ^2 reaches a boundary value, i.e., zero or one, which is discussed in the following. 

b) If = 1 and < /^2 < 1, then from ( |20] |. we obtain 

A2(^)=A3(^) - (1 - m)C2S) > Ai(i) (31a) 
Ae{i)=ti2Cri{i) + Cr2{i) > max{A4(i),A5(i)}. (31b) 

The probability that the above expressions hold with equality is zero. Therefore, we can set Ife(i) = 0, k — 1,4,5 
and Te{i) — 1, Vi. Moreover, we observe that constraint C2 in (O, i.e., R2r — Rri, only depends on 112, and as fi2 
decreases, R2r increases and Rri decreases. Since for ^2 — we obtain = < Rr2, a sufficient and necessary 
condition for the existence of a 112 such that constraint C2 in ( fT4l i holds is P2 > Pr- Since both thresholds are 
obtained, we could determine when to select mode Aie or {Ai2, -Ma}- Next, we need to determine the optimal 
policy to select between M2 and M3 if for some channel reahzations, we obtain A2{i) = A.s{i) > AQ{i) such 
that constraint CI in ( fT4l i holds and the maximum sum rate is achieved. Since the maximum sum rate is fixed 
to E {qQ{Cri + Cr2)} and it is only a function of 112, different strategies of satisfying CI in ( fT4l i yield the same 
sum rate. Thus, we consider a simple probabihstic approach for choosing Ai^ with probability pi and Ai2 with 
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probability 1 — pi- To implement the probabilistic strategy, we set 13(1) = 1 —I-zii) = ^li'i) where Xi{i) is the 
outcome of a coin flip with probability pi — ^^f^i^^^^^''^]_Q^ where pi is chosen such that constraint CI in (fl4] i 
is satisfied. 

c) If < /ii < 1 and /12 = 0, then from ( |20| |. we obtain 



>max{Ai(i),A2(0} (32a) 

A5(z)=A6(«) = ^ilCr2{i) > A4(«). (32b) 

Similar to Case b, the probability that the above expressions hold with equality is zero. Therefore, we set Ikii) = 

E{q3C2r} 

£;{(l-g3)C,.i} 



0, fc = 1,2,4,13(1) — 1, and leii) = 1 — ^bi'i) = A'2(i) with p2 = pftlT^^r \ Moreover, the necessary 



condition for this case is that P2 < Pr and the sum rate is E{q'iCr}- 
d) If = 1 and 112 ~ 0, then from (|20] |. we obtain 

A2(z)=A3W = C2.(z)>Ai(z) (33a) 
Kr^{{)=KS) = Cr2{i) > A4(i). (33b) 

The probability that the above expressions hold with equality is zero and therefore, we can set Xfc(i) = 0, k = 1, 4. 
Moreover, one can easily conclude that the necessary condition for this case is P2 = Pr, otherwise, only one of 
{Ai2, -Ms} or {Ai^, Me} is selected in all time slots which violates constraints CI and C2 in (fl4l l. Thus, we 
obtain A2{i) — ^.^{i) — ^^{i) = AQ{i), Vi. Moreover, considering that CI and C2 in (fl4l l hold, the sum rate is 
R2r + Rr2 = E{{q2 + qz)C2r} + E{{qc, + qQ)Cr2} — E{C2r} — E{Cr2}- Moreover, the maximum sum rate is 
achieved by a probabilistic approach as follows 

Mi) ^ X3{i)[l - Xi{i)] 

(34) 

Ml) = [1 - X3{l)]X5it) 

where the coin flip probabilities p4 = ^p^^ e{c'^~(J2 } ~ i-p3 £{C'''i} chosen such that constraints CI 

or C2 in ( fT4l i are satisfied, respectively, and probability p^ is chosen to achieve the optimal sum rate. However, for 
the assumed values of p4 and ps, the sum rate is R12 + R21 — E{C2r} — E{Cr2} which is indeed the maximum 
sum rate. Therefore, any value of p^ which results in an acceptable value of p4 and ps, i.e., < P4,P5 < 1, is 

optimal. From this condition, we obtain ^J'^'^^l < P3 < Eic^f+C2 } ' ^^^^"^ ~ ^e{c^} ^'^'^ ~ E{c^f+C2 } 
lead to P4 = 1 and p^ = 1, respectively. 

Case 3: If t{i) = 1, then considering (fTSll), we obtain 0o(i) = and from (fT7t). we obtain <f>i{i) > 0. Combining 
these results with ( |25] |. we obtained /ii < fi2. Following a similar procedure as for Case 2, we obtain similar results. 
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C. SNR Regions 

We obtained different selection policies and the necessary conditions of optimality for each of them. Now, we 
determine which of the derived selection pohcies will be used based on the statistics of the channels and the powers 
of the nodes. To this end, we define three mutually exclusive SNR regions 1^1,1^2-, and TZq. Each of the necessary 
conditions is realized only in one of these SNR regions and since the SNR regions are mutually exclusive and the 
necessary conditions are mutually exclusive too, the selection policy for each necessary condition is optimal for 
the corresponding SNR region. 

First, we observe that the multiple access and broadcast modes are selected with non-zero probabihty in all the 
discussed cases. However, the point-to-point modes are selected only when thresholds /ui and fjb2 have a boundary 
value, i.e., zero or one. By this observation, we define the SNR regions 7?.i, 7^2 and IZq based on the utilization of 
point-to-point modes as follows: 

SNR Region IZq: In this SNR region, none of the point-to-point modes are selected. Moreover, the necessary 
conditions for the thresholds are < /ii < 1 and < /U2 < 1. 

SNR Region TZi'. In this SNR region, one or both of point-to-point modes A^2 and AI5 are selected. Moreover, 
the necessary condition for the thresholds is /ii > /U2 where at least one of the thresholds has a boundary value. 
Specifically, in this SNR region, if P2 > -Pr. we obtain fi\ = l and < /i2 < 1 and mode ^^2 is selected in some 
time slots, whereas if P2 < Pr, we obtain < /xi < 1 and /U2 = and mode M.^ is selected in some time slots, 
and finally if P2 = Pr, we obtain /xi = 1 and /U2 = and both modes M.2 and M.5 can be selected in some time 
slots. 

SNR Region 1^2' In this SNR region, one or both of the point-to-point modes Ai\ and M.4, is selected. Moreover, 
the necessary condition for the thresholds is /ii < 112 where at least one of the thresholds has a boundary value. 
Similar to SNR region TZi, in this SNR region, if Pi > P^. we obtain < /xi < 1 and /U2 = 1 and mode AI1 is 
selected in some time slots, whereas if Pi < Pr, we obtain /^i = and < /X2 < 1 and mode M.i is selected 
in some time slots, and finally if P2 = Pr, we obtain /ii = and /X2 = 1 and both modes Mi and M.i can be 
selected in some time slots. 

Roughly speaking, in SNR region TZi, the user 1-to-relay Unk is much stronger than the user 2-to-relay Unk, and 
one or both of the point-to-point modes between the relay and user 2 are selected. Similarly, in SNR region 1^2, 
the user 2-to-relay Unk is much stronger than the user 1-to-relay Unk, and one or both of the point-to-point modes 
between the relay and user 1 are selected. However, in SNR T^o. the quaUty of user 1-to-relay Unk is similar to that 
of the user 2-to-relay link, and none of the point-to-point modes is selected. To easily distinguish the SNR regions 
based on the channel statistics, we investigate the transition conditions between the regions. We note that as the 
quaUty of the user 1-to-relay Unk improves compared to the user 2-to-relay Unk, the SNR region changes from 1^2 
to IZo and then from IZq to IZi. Therefore, the transition condition between SNR regions IZi and IZq occurs when 
the coin flip probabilities are equal to one, i.e., Pi,P2,P4,P5 ^ 1- Then, based on whether Pr is larger, smaller or 
equal to P2, we use the optimal strategy introduced in Case b, c, and d, respectively. Therefore, the SNR region is 
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TZi if and only if 

{P2 > Pr AND Wi < 1} 
OR {P2 < Pr AND UJ2 < 1} 

OR {P2 = Pr AND 4 < io^} (35) 
where Wi = g|(i^,^)'g:l^c-,^]p ^2 = EUi-%l,} '^i = and = gigf^c^} and is defined as 

'1, if {p.>P.AND[A3<Ae];«t.::.J 
= Or{f. < P. AND [A3 > Aa];^:L;';;;:^4 (36) 

0, otherwise 

where if P2 > Pr, the threshold is chosen such that £'{(1 — q)C2r} = E{qCri} holds, whereas, if P2 < Pr, the 
threshold /i* is chosen such that E{q[Cr — C2r]} = E{{1 — q)Cr2} holds. In a similar manner, one can distinguish 
between SNR region Tl2 and TZq. We note that all conditions of optimality for these three exclusive SNR regions 
are obtained in this section and the thresholds ^1 and /i2 which resulted in the maximum sum rate are shown to 
be unique. Thus the proposed solution is optimal. This completes the proof. 

Appendix C 
Proof of Binary Relaxation 

In this appendix, we prove that the optimal solution of the problem with the relaxed constraints, < (?fe(j) < 1, 
can also be achieved if the values of are binary, i.e., if qk{i) G {0, 1}. Therefore, the binary relaxation does 
not change the maximum sum rate. We start with noting that if one of the gfc(i), fc = 1, . . . , 6, adopts a non-binary 
value in the optimal solution, then in order for constraint C3 in (|4| to be satisfied, there has to be at least one other 
non-binary selection variable in that time slot. Assuming that the mode indices of the non-binary selection variables 
are k' and k" in the i-th time slot, we obtain ak{i) = 0, k = 1, . . . ,6 from (fTSh), fik' («) = and f3k" {i) = from 
(fTSb). Then, by substituting these values into ( fT9b , we obtain 

A(i) = Afe.(z) (37a) 
A(i)-Afe»(z) (37b) 
A(i)-/33(z) = Afe(z), k^k',k". (37c) 

From ([37b) and (|37b). we obtain Ak' (i) — A^" (?) and by subtracting (|37k) and ([37b) from ([37]:), we obtain 

Ak'{i)~Aki^)^M^), k^k',k" (38a) 
Ak''{i)-Ak{i)^l3k{i), k^k',k". (38b) 



May 1, 2013 



DRAFT 



21 



^tl = 1 

A 



{Mi,M4} 



{M-,.Mi} 



{Ms} 




{Ms. Ms} 



{M,} 



-IJ.2 = 1 



{M2,Me} 



{M2,M5} 



Fig. 4. Modes with non-negative powers in tlie space of (/ii,/J2)- 



From the dual feasibility condition given in (fTTb). we have /3k{i) > which leads to Ak'{i) = Afe'/(z) > Ak{i)- 
However, as a result of the randomness of the time-continuous channel gains, Pr{Afc/(i) = Afc"(i)} > holds for 
some transmission modes Ai^' and J^k" if and only if the optimal value of either /ii or fi2 is at a boundary, i.e., 
/^i = 0, 1 or /i2 = 0, 1. In these cases, we observe that Ak'{i) = Ak"{i), Vi, and we have shown in Appendix IB] 
that for /ii = 0, 1 and /i2 = 0, 1, we can obtain the optimal sum rate by using a probabilistic approach to choose 
A4k' with probability p and A4k" with probability 1 —p, which results in a binary value for the selection variables 
qk'{i) and qk"{i)- This completes the proof. 

Appendix D 
Threshold Regions 

In this appendix, we determine the interval which contains the optimal values of /ii,/i2- Fig- 14] represents the 
set of candidate selection modes for all channel realizations in the space of (/ii,/i2). These candidate modes are 
obtained based on the necessary selection condition introduced in (l23T l as: 

1) If /ii < 1 and fi2 > 1, then Mi is a candidate for selection since we obtain Ai(i) > niax{A2(i), A3(i)} from 
(EOll. 

2) If fii > 1 and fi2 < 1, then A42 is a candidate for selection since we obtain A2(i) > max{Ai(i), A3(i)} from 
(EOll. 

3) If /ii < 1 and /i2 < 1, then, based on (|26b) and ([30k). A^s is a candidate for selection since we obtain 
A3(i) > max{Ai(i), A2(i)} from ( |20] i. 

4) If fii < and fi2 > 0, then AA^ is a candidate for selection since we obtain A4(i) > max{A5(i), A6(i)} from 
(EOll. 
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5) If fii > and fi2 < 0, then Ai^ is a candidate for selection since we obtain A^{i) > max{A4(z), A6(i)} from 
(EOli. 

6) If fii > and /i2 > 0, then Mq is a candidate for selection since we obtain A^li) > max{A4{i) , A^{i)} from 

daoii- 

It is straightforward to see that all sets in Fig. |4] contradict one or both constraints CI and C2 in (|4|, except 
{A^3, J^e}, which corresponds to < /ii < 1 and < /i2 < 1- Note that if /ii and /i2 are at the boundaries, i.e., 
/^i = 0,1 and = 0, 1, then the neighbor modes are also candidates for selection. 

We note that /ii and /i2 cannot be simultaneously equal to zero since this results in A4(i) = A5(i) = Ag(«) = 0, 
Vi, and thus only the transmission from the users to the relay will be adopted and this contradicts constraints CI 
and C2 in (|4|i. Similarly, /ii and /12 cannot be simultaneously equal to one since it also contradicts constraints CI 
and C2 in ©. 
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